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Preface 



The purpose of writing this book is to suggest some improved estimators using 
auxiliary information in sampling schemes like simple random sampling, systematic 
sampling and stratified random sampling. 

This volume is a collection of five papers, written by nine co-authors (listed in the 
order of the papers): Rajesh Singh, Mukesh Kumar, Manoj Kr. Chaudhary, Cem Kadilar, 
Prayas Sharma, Florentin Smarandache, Anil Prajapati, Hemant Verma, and Viplav Kr. 
Singh. 



In first paper dual to ratio-cum-product estimator is suggested and its properties are 
studied. In second paper an exponential ratio-product type estimator in stratified random 
sampling is proposed and its properties are studied under second order approximation. In 
third paper some estimators are proposed in two-phase sampling and their properties are 
studied in the presence of non-response. 

In fourth chapter a family of median based estimator is proposed in simple random 
sampling. In fifth paper some difference type estimators are suggested in simple random 
sampling and stratified random sampling and their properties are studied in presence of 
measurement error. 

The authors hope that book will be helpful for the researchers and students who are 
working in the field of sampling techniques. 
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Abstract 

Tracy et al.[8] have introduced a family of estimators using Srivenkataramana and Tracy 
([6], [7]) transformation in simple random sampling. In this article, we have proposed a dual 
to ratio-cum-product estimator in stratified random sampling. The expressions of the mean 
square error of the proposed estimators are derived. Also, the theoretical findings are 
supported by a numerical example. 

Key words: Auxiliary information, dual, ratio-cum-product estimator, stratified random 
sampling, mean square error and efficiency. 

1. Introduction 

In planning surveys, stratified sampling has often proved as useful in improving the precision 
of un-stratified sampling strategies to estimate the finite population mean of the study 
_ 1 L N h 

variable, Y = — £ XYhi • Let y, x and z respectively, be the study and auxiliary variates 
N h=li=l 

on each unit Uh (h=l,2,3, — , N) of the population U. Here the size of the stratum Uh is Nh, 
and the size of simple random sample in stratum Uh is nh, where h=l, 2, — ,L. In this study, 
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under stratified random sampling without replacement scheme, we suggest estimators to 
estimate Y by considering the estimators in Plikusas [3] and in Tracy et al. [8]. 



To obtain the bias and MSE of the proposed estimators, we use the following notations in the 
rest of the article: 

y* = 2h=iw h y h =Y(l + e,j), 

= £h=iW h x b =X(l + e 1 ), 

Zst = 2h =1 W h z h = z(l+e 2; ). 



where, w h = — -. 
h N 



Such that, 

E(e a ) = ECeJ = E(e 2 ) = 0, 



rst 



L 

-I' 

h=l 



rHfs+t 
w h 



E[(y h - Y h m - x h ) B (z h - ZJI 



YTiFZ 1 



( 1 ) 



where y h and Y h are the sample and population means of the study variable in the stratum h, 
respectively. Similar expressions for X and Z can also be defined. 



Using (1), we can write 



E(eg) 



_ 



2h=l w hYfa S yh 

Y 2 



^200 > 



E(«D = 



2fa=i w bYh s *h 

X 2 



— ^020' 
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a> n=^yA 

E ( e 2) = « 



— ^002' 



E ( e o e i) = 



^b = 1 ^ h Yh ^xyh 



XY 



= V. 



110' 



c f ^ = 1 ^ h Yb ^xzh TT 

E(e a e 2 J = = = V 011J 



XZ 



where 



S^fo-Tt.) 2 



q2 

b yh 



N h -1 



, 2 ^,(^~Xb) 3 

^r-tb tlt vi - 1 b zb 



S^^b-Xb) 2 o2 S^db-Zb) 2 



Nfa — 1 



N b -1 



. . S^^h-XbXyh-Yh) _ _ S;; t ,(yh-Yi,)(z 1 ,-Zi,) 

^■^rr^lns I ^Trrk 



,N t _ v 



3 syb 



Nu — 1 



3 yzb 



Nu - 1 



_SSi(^h-Xfa)(zh-Zh) 



S .izh — 



Nu — 1 



1- fu 



nu 



Nu 



Nu 



£] r El [J 

Yh = — 1 f h = IT ' w h - "TT 1 



N 



The combined ratio and the combined product estimators are, respectively, defined as 

y l 



Y2 



-'-(S' 

-'■ft) 



( 2 ) 

(3) 



And the MSE of y± and y 2 to the first degree of approximation are, respectively, given by 



MSE(yJ * Y 2 (V 2QO + V 020 - 2V 110 ) (4) 

MSE(y 2 ) * Y 2 (V 200 + V Q2Q + 2V 110 ) (5) 

Note that Y = Y^ = 2^ =1 w h Y h . Similar expressions for X and Z can also be defined. 

2. Classical Estimators 
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Srivenkataramana and Tracy ([6], [7]) considered a simple transformation as 



u i — A — Xj, ( i=l, 2,...,N) 

=> u = A — x, 

where A is a scalar to be chosen. This transformation renders the situation suitable for a 
product method instead of ratio method. Clearly u st (= A — is unbiased for U(= A — X). 

Using this transformation, an estimator in the stratified random sampling is defined as 

y 3 = y* (if) ( 6 ) 

This is a product type estimator ( alternative to combined ratio type estimator) in stratified 
random sampling. 

The exact expression for MSE of y 3 is given by 

MSE(y 3 ) = Y 2 (V 200 + 9 2 V(j 20 - 20V 11O ) (7) 

X 

where 9 — — , 

(A-X) 

In some survey situations, information on a second auxiliary variable, Z, correlated 
negatively with the study variable, Y, is readily available. Let Z be the known population 

mean of Z. To estimate Y , Singh[4] considered ratio-cum-product estimator as 
_ fX\ /z\ 

where Perri[2] used t x =x + a(x-x) and t 2 =z + p(z-z) instead of x and z, 
respectively. Here, a and p are constants that make the MSE minimum. 
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Adapting y 4 to the stratified random sampling, the ratio cum product estimator using 



two auxiliary variables can be defined as 

^ =? “(£)(t) (a) 

The approximate MSE of this estimator is 

MSE(y 5 ) * Y 2 [V 20(J + V 020 + V oaz + 2(V 101 - V 110 - V 011 )] (9) 

3. Suggested Estimators 

Tracy et al. [8] introduced a product estimator using two auxiliary variables in the simple 
random sampling given by 

<*•> 

Motivated by Tracy et al. [8], we propose the following product estimator for the 
stratified random sampling scheme as 

»■-'■©(¥) <>» 

Expressing y 7 in terms of e’s, we can write (1 1) as 

y? = Y(l + e 0 )(l — 0e 1 )(l + e 2 ) 

The MSE(y 7 ) to the first order of approximation, is given as 

MSE(y 7 ) = Y 2 [V m + B 2 V a2Q + V aa2 - 2(0V 11O - V 1Q1 + 9V ail )] (12) 



and this MSE equation is minimised for 



Note that the corresponding A is 



A, 



opt 




By putting the optimum value of 0 in ( 12 ), we can obtain the minimum MSE equation 
for the first proposed estimator, y 7 . 

Remark 3.1 : The value of X is known, but the exact values of Vno, Von and V020 are 
rarely available in practice. However in repeated surveys or studies based on multiphase 
sampling, where information is gathered on several occasions it may be possible to guess the 
values of V110, V011 and V020 quite accurately. Even though this approach may reduce the 
precision, it may be satisfactory provided the relative decrease in precision is marginal, see 
Tracy et al. [ 8 ]. 

Plikusas[ 3 ] defined dual to ratio cum product estimator in stratified random sampling 



Considering the estimator in ( 13 ) and motivated by Singh et al. [ 5 ], we propose a family of 
dual to ratio cum product estimator as - 



as 




( 13 ) 



where 



Kt = (1 + EhK) - z* = (1 + Sh Z h) ~3h z h- 
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- - ( 2 ' 
y ’ = MTi 



( 14 ) 



To obtain the MSE of the second proposed estimator, y 9 , we write 



y Et = Y(l + e 0 ), 



x^ = (l + e*>. 



z^ = (l + e |). 



Expressing (14) in terms of e’s, we have 



y 9 = Y(l + e 0 )(l+e')<Ml + ^)-“* 



(15) 



Expanding the right hand side of (15), to the first order of approximation, we get 



(y 9 “ Y) = Y[e a + a 1 e 0 e 1 - d 2 e,je 2 + d^ -a 2 e 2 - a^e!^ 



+ 



a i( a i ~ 1) f2 , « 2 (a 2 +1) , 



■ei + 



■2] 



(16) 



2 2 

Squaring both sides of (16) and then taking expectation, we obtain the MSE of the second 
proposed estimator, y 9 , to the first order approximation, as 



MSE(y 9 ) £ Y a {Vj ro + a 2 V' 20 + a 2 V' 02 + 2(d 1 V' 19 - - d 2 V' 01 » (17) 



where 






wr+s+t (_g) B+ t E( - ii _ Yh)( - h _ Xh)( - h _ 2h) 



b = l 



Y r X s Z t 



( 18 ) 



This MSE equation is minimized for the optimum values of and d 2 given by 



]/' if 1 — v' v' 

, _ p ioi ''oil Tio v m2 



a i = 



V' V —V 2 

^ 020^002 ''Oil 



(19) 
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( 20 ) 



v' v — v v 

r ' 02 0 *101 * 11 0^011 

CCn = 

* V* V — V 2 

'02 0 *002 *011 

Putting these values of and a 2 i n MSE (y g ), given in (17), we obtain the minimum MSE 
of the second proposed estimator, y g . 

4. Theoretical Efficiency Comparisons 

In this section, we first compare the efficiency between the first proposed estimator, y 7 , with 
the classical combined estimator, y^, as follows: 

MSE(y 7 ) < 7(yJ 

Y 2 [V 200 + 0 2 V O2(J + V 002 - 2(0V 11(J - V 101 + 0V Q11 J] < Y 2 V 20lJ . 

The estimator y 7 is better than the usual estimator y^, if and only if, 



JL 

2B 2 



< 1 , 



(21) 



where, B t — 0 V 020 + V 002 and B 2 — 0V 11O V 1O1 + 0V O11 . 

If the condition (21) is satisfied, the first proposed estimator, y 7 , performs better than 



the classical combined estimator. 



We also find the condition under which the second proposed estimator, y 9 , performs 
better than the classical combined estimator in theory as follows: 

MSE(y g ) < V(yj, 

Y-{V 200 + txjV(j 2 Q + ®%yoa 2 "b 2(ot 1 V 110 — tx 1 a. 2 V (] , ;L1 — o^Vkh)} < Y“V 2 qq, 
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^ 1^020 ^ 2 ^ 0(12 ^ 0t l 0E 2 ^ r ( 

0t 2^1Ql _ OlViio 



l u 2 * Oil 



< 1, 



The estimator y 9 is better than the usual estimator y^, if and only if, 



C 



( 22 ) 



where, C ^ 1^020 ^ ^ 2^002 2ct^ &nd B ^-2 ^101 ^l^m' 

5. Numerical Example 

In this section, we use the data set earlier used in Koyuncu and Kadilar[l] . The population 
statistics are given in Table 1. In this data set, the study variable (Y) is the number of 
teachers, the first auxiliary variable (X) is the number of students, and the second auxiliary 
variable (Z) is the number of classes in both primary and secondary schools for 923 districts 
at 6 regions ( as 1: Marmara, 2: Agean, 3: Mediterranean, 4: Central Anatolia, 5: Black Sea, 
6: East and South east Anatolia) in Turkey in 2007, see Koyuncu and Kadilar[l], Koyuncu 
and Kadilar[l] have used Neyman allocation for allocating the samples to different strata. 
Note that all correlations between the study and auxiliary variables are positive. Therefore, 
we decide not to use product estimators for this data set for efficiency comparison. For this 
reason, we apply the classical combined estimator, y rt , combined ratio estimator , y 1? the 

ratio-cum-product estimator, y 5 , Plikusas [3] estimator, y B , and the second proposed 
estimator, y 9 , to the data set. For the efficiency comparison, we compute percent relative 
efficiencies as 



PRE(y ± ) = 




i — st, 1,5,Q,9- 
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Table 1 . Data Statistics of Population 



Ni=127 

N 4 =170 

ni=31 

n 4 =38 

S yl = 883.835 
S y4 = 810.585 
Yj = 703.74 
Y 4 =424.66 
S xl =30486.751 
S x4 =18218.931 
X ± =20804.59 
X 4 =9478.85 
3^ =25237153.52 
= 14523885.53 
Pxyl = 0.936 
Pxy4 = 0.983 
S zl =555.5816 
S 24 = 458.0282 
Z L = 498.28 
Z 4 = 498.28 
= 480688.2 
s yzi = 364943.4 
S., = 15914648 
s *z4 = 8041254 



N 2 =117 

Ns=205 

M2=2 1 
ns=22 

S y2 = 644 
S y5 = 403.654 
% =413 
Y s = 267.03 
S x2 =15180.760 
S x5 =8997.776 
X 2 =9211.79 
X 5 = 5569.95 
S^ 2 =9747942.85 
S^i =3393591.75 
PxyZ = 0.996 
Pxy5 = 0.989 
S 2Z = 365.4576 
S z5 =260.8511 
Z 2 = 318.33 
Z 5 = 227.20 
s yzz = 230092.8 
S yzl = 101539 
3^2 = 5379190 
S*z5 = 2144057 



N 3 =103 

Ne=201 

n 3 =29 

( 16=39 

S y3 = 1033.467 
S y6 =7 11.723 
Y 3 = 573.17 
Y 6 =393.84 
S x3 =27549.697 
S x6 =23094.141 
X 3 =14309.30 
X 6 =12997.59 
S :ty3 =28294397.04 
=15864573.97 
Pxy3 =0-994 
Pxy6 = 0.965 
s z3 =612.9509281 
S z& = 397.0481 
Z 3 =431.36 
Z 6 = 313.71 
S T/Zl = 623019.3 
s yzi =277696.1 
3^3 = 164900674.56 
S Ml = 8857729 
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Pyzl = 0.978914 
Pyz4 = 0.982958 



Pyz2 = 0.9762 
Pjrz5 = 0.964342 



p y3 =0.983511 
Pyzl = 0.982689 



Table 2. Percent Relative Efficiencies (PRE) of estimators 



Estimators 


Values of 


Values of ct 2 


PREfr) 


y* 


0 


0 


100 


?i 


1 


0 


1029.469 


y 5 


1 


1 


149.686 


y 3 


1 


1 


115.189 


MSE(y g )min 


6.2918 


-0.8870 


2854.549 



Table3. The MSE values according to A 



Value of 0 


Corresponding value of A MSE (y 7 ) 


<0.8 


- 


>V(yst) 


0.8 


25779.79 


2186.879 


0.9 


24188.44 


1814.999 


1.00 


22915.37 


1492.895 


1.10 


21873.76 


1220.564 


1.20 


21005.75 


998.009 


1.30 


20271.29 


825.227 


1.40 


19641.74 


702.221 


1.50 


19096.14 


628.989 


1.5971 (opt) 


18631. 62(opt) 


605.511* 


1.60 


18618.74 


605.532 


1.70 


18197.50 


631.849 


1.80 


17823.06 


707.941 


1.90 


17488.04 


833.807 


2.00 


17186.53 


1009.448 


2.10 


16913.72 


1234.864 


2.20 


16665.72 


1510.054 


2.30 


16439.29 


1835.019 


2.40 


16231.72 


2209.758 


>2.40 


- 


>V(yst) 



* MSE (min) at the value A(optimal). 
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6. Conclusion 



When we examine Table 2, we observe that the second proposed estimator, y 9 , under 

optimum condition certainly performs quite better than all other estimators discussed here. 
Although the correlations are negative, we also examine the performance of the first proposed 
estimator, y 7 , according to the classical combined estimator. Therefore, for various values of 

A and 0 in Table 3, the MSE values of and y 7 are computed. From Table 3, we observe 
that the first proposed estimator, y 7 , performs better than the estimator, y rt , for a wide range 
of 9 as 9et0.8j2.40], even in the negative correlations. 
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